Stress-Testing General Purpose Digital Library Software

نویسندگان

  • David Bainbridge
  • Ian H. Witten
  • Stefan J. Boddie
  • John Thompson
چکیده

DSpace, Fedora, and Greenstone are three widely used open source digital library systems. In this paper we report on scalability tests performed on these tools by ourselves and others. These range from repositories populated with synthetically produced data to real world deployment with content measured in millions of items. A case study is presented that details how one of the systems performed when used to produce fully-searchable newspaper collections containing in excess of 20 GB of raw text (2 billion words, with 60 million unique terms), 50 GB of metadata, and 570 GB of images.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

General purpose medical digital library definition

1 The need of an approach for the definition of a platform-independent medical digital library, using only 2 open-source tools, will be described. To test the need and the success of such an approach, a library will 3 be created, which can later be used in a larger scale as a general purpose digital medical tool, when comes 4 the need to evaluate an image. 5 As a first test, the library will be...

متن کامل

General-Purpose Digital Library Content Laboratory Systems1

The last decade witnessed a proliferation of systems specially devised for aggregating and then operating over information objects – e.g., publications, experimental data, multimedia and compound objects – collected from possibly heterogeneous and autonomous data sources. Such systems, to which we refer as “Digital Library Content Laboratories”, are typically highly domain-specific and thus fea...

متن کامل

DIGITAL LIBRARIES: THE SYSTEMS ANALYSIS PERSPECTIVE Cataloging for the masses

Purpose – The purpose of this paper is to explore methods for opening up web content to automated classification using metadata, potentially in the context of library groupware or portals. Design/methodology/approach – Examines various web sites and meta-searching tools which provides a new means of access for users, and allow users to better document and integrate their research findings. Find...

متن کامل

DSPSR: Digital Signal Processing Software for Pulsar Astronomy

DSPSR is a high-performance, open-source, object-oriented, digital signal processing software library and application suite for use in radio pulsar astronomy. Written primarily in Cþþ, the library implements an extensive range of modular algorithms that can optionally exploit both multiple-core processors and general-purpose graphics processing units. After over a decade of research and develop...

متن کامل

Using Open Source Software for Digital Libraries: A Case Study of CUSAT

Purpose – The purpose of this paper is to describe the design and development of a digital library at Cochin University of Science and Technology (CUSAT), India, using DSpace open source software. The study covers the structure, contents and usage of CUSAT digital library. Design/methodology/approach – This paper examines the possibilities of applying open source in libraries. An evaluative app...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009